National Repository of Grey Literature 132 records found  1 - 10nextend  jump to record: Search took 0.01 seconds. 
Implementation of Simple Speech Recognizer in a Web Browser
Crkoň, Jakub ; Glembek, Ondřej (referee) ; Szőke, Igor (advisor)
The goal of this project is to implement simple speech recognizer for web browser. This paper describes fundamental components required for implementing speech recognizer and techniques which are used for optimization process of speech recognition in web browser. At first, the paper focuses on introduction of speech recognition theory. It describes individual parts and principles of speech recognizer. In next section, thesis describes design, implementation and principles of acceleration of speech recognizer with limited computing resources of web browser. The implementation is divided into modules making up the library for usage in web browser. The library is easily extendable and usable in various web applications. Finally, it discusses potential directions of development and usability of this project.
Speech Technology Application in Pronunciation Training and Foreign Language Learning
Barotová, Štěpánka ; Žmolíková, Kateřina (referee) ; Szőke, Igor (advisor)
Tato diplomová práce pojednává o využití algoritmu Dynamic Time Warping (DTW) pro automatické hodnocení výslovnosti anglického jazyka. Práce se zaměřuje na vylepšení již existující aplikace pro výuku výslovnosti, a to ve třech oblastech: uživatelské rozhraní, samotný algoritmus a korektivní zpětná vazba uživateli. První část se věnuje přehledu technik používaných v této oblasti, následně je představen nový design uživatelského rozhraní, popsán navržený systém a experimenty. Experimenty se zaměřují na problematiku detekce chyb na úrovni fonémů, na detekci chyb v primárním důrazu na úrovni slabik a na hodnocení intonace na úrovni slov. Všechny použité metody jsou navrženy tak, aby poskytovaly korektivní zpětnou vazbu uživateli. V poslední části je popsáno, jak byly všechny tři vylepšené oblasti aplikace otestovány.
Recognition of Isolated Words for Electronic Dictionaries
Hrdlička, Pavel ; Szőke, Igor (referee) ; Grézl, František (advisor)
This work is concerned with creation of isolated word recognizer for electronic dictionaires, testing its functionality on data sample and improvement by normalisation and speaker adaptation techniques. Word recognizer is built on HTK (Hidden Markov Model Toolkit). At the beginning of this document, the main aims of the work are set. In the next chapter is theoretical analysis, which describes process of recognition of isolated words with hidden Markov models. Next chapter specifies the speech data, which were used for testing. Other resources for building recognizer, like models, dictionary and grammar are described in next chapter. Before creation of recognizer, it was necessary to solve conversion between the phonemes set which was used in dictionary and set, which uses the recognizer. The recognizer was built with 8~kHz models first, than 16~kHz models were also used. Normalisation and speaker adaptation techniques were used. Obtained data were processed and results are analyzed in separate chapter. Finally is discussed, if the goals of the work were reached and what are the next steps of application development.
Integration of Voice Technologies on Mobile Platforms
Černičko, Sergij ; Černocký, Jan (referee) ; Schwarz, Petr (advisor)
The goal of the thesis is being familiar with methods a techniques used in speech processing. Describe the current state of research and development of speech technology. Project and implement server speech recognizer that uses BSAPI. Integrate client that will use server for speech recognition to mobile dictionaries of Lingea company.
Voice Recording and Search for Skype
Nytra, Jiří ; Szőke, Igor (referee) ; Schwarz, Petr (advisor)
This work deals with the creation of a program communicating with Skype, which provides record calls in which can search for keywords by using advanced speech recognition technology. The work is presented and the interface protocol to communicate with Skype, call recording and method LVCSR for searching keywords.
Penetration Tests of Speaker Verification System
Wojnar, Filip ; Landini, Federico Nicolás (referee) ; Plchot, Oldřich (advisor)
Cílem práce je provést penetrační testy na systému pro automatickou verifikace řečníka za použití syntézy řeči. Práce se zabývá fungování systému pro automatickou verifikaci řečníka a spoofing útoky na systémy, zabývající se touto problematikou. Práce se také podrobnějí zabývá fungováním syntézy řeči. Pozdější kapitoly se zabývají realizací penetračních testů a výsledky, které nám tyto testy přinesly.
Parallel Training of Neural Networks for Speech Recognition
Veselý, Karel ; Fousek, Petr (referee) ; Burget, Lukáš (advisor)
This thesis deals with different parallelizations of training procedure for artificial neural networks. The networks are trained as phoneme-state acoustic descriptors for speech recognition. Two effective parallelization strategies were implemented and compared. The first strategy is data parallelization, where the training is split into several POSIX threads. The second strategy is node parallelization, which uses CUDA framework for general purpose computing on modern graphic cards. The first strategy showed a 4x speed-up, while using the second strategy we observed nearly 10x speed-up. The Stochastic Gradient Descent algorithm with error backpropagation was used for the training. After a short introduction, the second chapter of this thesis shows the motivation and introduces the neural networks into the context of speech recognition. The third chapter is theoretical, the anatomy of a neural network and the used training method are discussed. The following chapters are focused on the design and implementation of the project, while the phases of the iterative development are described. The last extensive chapter describes the setup of the testing system and reports the experimental results. Finally, the obtained results are concluded and the possible extensions of the project are proposed.
Unsupervised Adaptation of Speech Recognizer
Švec, Ján ; Karafiát, Martin (referee) ; Schwarz, Petr (advisor)
The goal of this thesis is to design and test techniques for unsupervised adaptation of speech recognizers on some audio data without any textual transcripts. A training set is prepared at first, and a baseline speech recognition system is trained. This sistem is used to transcribe some unseen data. We will experiment with an adaptation data selection process based on some speech transcript quality measurement. The system is re-trained on this new set than, and the accuracy is evaluated. Then we experiment with the amount of adaptation data.
Using of voice for home eguipment control
Bubla, Lukáš ; Vojáček, Zdeněk (referee) ; Lacko, Branislav (advisor)
The bachelor thesis is focused on home automation, specifically on voice control. It describes methods of communication and control of automated systems and especially problematics of computer voice recognition using different methods. The thesis displays specific product for voice controlling developed by Czech company Elko ep, s.r.o. and then briefly looks at the issue of gesture control. At the end compares product from Elko ep with other products from foreign companies and discuss advantages and disadvantages of voice or gestures control.
Speech Recognition For Selected Languages
Schmitt, Jan ; Karafiát, Martin (referee) ; Janda, Miloš (advisor)
This bachelor's thesis deals with recognition of continues speech for three languages - Bulgarian, Croatian and Swedish. There are described basics of speech processing and recognition methods like acoustic modeling using hidden Markov models and gaussian mixture models. Another aim of this work is preparing data for those languages from GlobalPhone database, so they may be used with speech recognition toolkits Kaldi and HTK. With data prepared there are several models trained and tested using Kaldi toolkit.

National Repository of Grey Literature : 132 records found   1 - 10nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.